AITopics | optimal control law

Thompson sampling (TS) is an effective method to explore parametric uncertainties and can therefore be used for active learning-based controller design. However, TS relies on finite parametric representations, which limits its applicability to more general spaces, which are more commonly encountered in control system design. To address this issue, this work pro poses a parameterization method for control law learning using reproducing kernel Hilbert spaces and designs a data-driven active learning control approach. Specifically, the proposed method treats the control law as an element in a function space, allowing the design of control laws without imposing restrictions on the system structure or the form of the controller. A TS framework is proposed in this work to explore potential optimal control laws, and the convergence guarantees are further provided for the learning process. Theoretical analysis shows that the proposed method learns the relationship between control laws and closed-loop performance metrics at an exponential rate, and the upper bound of control regret is also derived. Numerical experiments on controlling unknown nonlinear systems validate the effectiveness of the proposed method.

artificial intelligence, control law, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2506.22186

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > Canada > British Columbia > Vancouver Island > Capital Regional District > Victoria (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Energy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Functional role of synchronization: A mean-field control perspective

Mehta, Prashant, Meyn, Sean

arXiv.org Machine LearningFeb-1-2025

Our friend and mentor Peter Caines has, together with his colleagues, created new foundations for studying collective dynamics in complex systems. Of particular inspiration to us has been his pioneering work in mean-field games (MFGs) launched two decades ago [10, 24, 25], and the related field of mean-field control. Peter pointed the way to both formulate and solve the problem of collective dynamics arising in a large population of heterogeneous dynamical systems. In this paper we survey some elements of MFGs within the context of controlled coupled oscillators. We begin by introducing a model for a single oscillator: dθ(t) = (ω + u(t)) dt + σ dξ(t), mod 2π (1) where θ(t) [0, 2π) is the phase of the oscillator at time t, ω is the nominal frequency with units of radiansper-second, {ξ(t): t 0} is a standard Wiener process, and u(t) is a control signal whose interpretation depends on the context. Unless otherwise noted, the SDEs are interpreted in their Itô form.

artificial intelligence, machine learning, oscillator, (19 more...)

arXiv.org Machine Learning

2502.0059

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry:

Energy > Power Industry (1.00)
Energy > Renewable (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Game Theory (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings

Bevanda, Petar, Hoischen, Nicolas, Sosnowski, Stefan, Hirche, Sandra, Houska, Boris

arXiv.org Machine LearningJul-23-2024

This paper proposes a fully data-driven approach for optimal control of nonlinear control-affine systems represented by a stochastic diffusion. The focus is on the scenario where both the nonlinear dynamics and stage cost functions are unknown, while only control penalty function and constraints are provided. Leveraging the theory of reproducing kernel Hilbert spaces, we introduce novel kernel mean embeddings (KMEs) to identify the Markov transition operators associated with controlled diffusion processes. The KME learning approach seamlessly integrates with modern convex operator-theoretic Hamilton-Jacobi-Bellman recursions. Thus, unlike traditional dynamic programming methods, our approach exploits the ``kernel trick'' to break the curse of dimensionality. We demonstrate the effectiveness of our method through numerical examples, highlighting its ability to solve a large class of nonlinear optimal control problems.

denote, operator, optimal control, (15 more...)

arXiv.org Machine Learning

2407.16407

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > Hungary > Budapest > Budapest (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.87)

Add feedback

Neighboring Extremal Optimal Control Theory for Parameter-Dependent Closed-loop Laws

Rai, Ayush, Mou, Shaoshuai, Anderson, Brian D. O.

arXiv.org Artificial IntelligenceDec-7-2023

This study introduces an approach to obtain a neighboring extremal optimal control (NEOC) solution for a closed-loop optimal control problem, applicable to a wide array of nonlinear systems and not necessarily quadratic performance indices. The approach involves investigating the variation incurred in the functional form of a known closed-loop optimal control law due to small, known parameter variations in the system equations or the performance index. The NEOC solution can formally be obtained by solving a linear partial differential equation, akin to those encountered in the iterative solution of a nonlinear Hamilton-Jacobi equation. Motivated by numerical procedures for solving these latter equations, we also propose a numerical algorithm based on the Galerkin algorithm, leveraging the use of basis functions to solve the underlying Hamilton-Jacobi equation of the original optimal control problem. The proposed approach simplifies the NEOC problem by reducing it to the solution of a simple set of linear equations, thereby eliminating the need for a full re-solution of the adjusted optimal control problem. Furthermore, the variation to the optimal performance index can be obtained as a function of both the system state and small changes in parameters, allowing the determination of the adjustment to an optimal control law given a small adjustment of parameters in the system or the performance index. Moreover, in order to handle large known parameter perturbations, we propose a homotopic approach that breaks down the single calculation of NEOC into a finite set of multiple steps. Finally, the validity of the claims and theory is supported by theoretical analysis and numerical simulations.

control law, equation, optimal control law, (14 more...)

arXiv.org Artificial Intelligence

2312.04733

Country:

Oceania > Australia (0.04)
North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Systems and Facilities > Geothermal System for Power Generation > Advanced Geothermal System (AGS) (0.83)

Technology:

Information Technology > Control Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)

Add feedback

Compositionality of optimal control laws

Neural Information Processing SystemsApr-6-2023, 13:51:24 GMT

We present a theory of compositionality in stochastic optimal control, showing how task-optimal controllers can be constructed from certain primitives. The primitives are themselves feedback controllers pursuing their own agendas. They are mixed in proportion to how much progress they are making towards their agendas and how compatible their agendas are with the present task. The resulting composite control law is provably optimal when the problem belongs to a certain class. This class is rather general and yet has a number of unique properties - one of which is that the Bellman equation can be made linear even for non-linear or discrete dynamics.

bellman equation, compositionality, optimal control law, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence (0.66)
Information Technology > Control Systems (0.65)

Add feedback

Error-free approximation of explicit linear MPC through lattice piecewise affine expression

Xu, Jun, Lou, Yunjiang, De Schutter, Bart, Xiong, Zhenhua

arXiv.org Artificial IntelligenceJul-20-2022

In this paper, the disjunctive and conjunctive lattice piecewise affine (PWA) approximations of explicit linear model predictive control (MPC) are proposed. The training data are generated uniformly in the domain of interest, consisting of the state samples and corresponding affine control laws, based on which the lattice PWA approximations are constructed. Re-sampling of data is also proposed to guarantee that the lattice PWA approximations are identical to explicit MPC control law in the unique order (UO) regions containing the sample points as interior points. Additionally, under mild assumptions, the equivalence of the two lattice PWA approximations guarantees that the approximations are error-free in the domain of interest. The algorithms for deriving statistically error-free approximation to the explicit linear MPC are proposed and the complexity of the entire procedure is analyzed, which is polynomial with respect to the number of samples. The performance of the proposed approximation strategy is tested through two simulation examples, and the result shows that with a moderate number of sample points, we can construct lattice PWA approximations that are equivalent to optimal control law of the explicit linear MPC.

approximation, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2110.00201

Country:

Asia (1.00)
North America > United States (0.93)

Genre: Research Report (0.84)

Industry: Energy > Oil & Gas > Downstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Compositionality of optimal control laws

Todorov, Emanuel

Neural Information Processing SystemsFeb-15-2020, 03:43:19 GMT

We present a theory of compositionality in stochastic optimal control, showing how task-optimal controllers can be constructed from certain primitives. The primitives are themselves feedback controllers pursuing their own agendas. They are mixed in proportion to how much progress they are making towards their agendas and how compatible their agendas are with the present task. The resulting composite control law is provably optimal when the problem belongs to a certain class. This class is rather general and yet has a number of unique properties - one of which is that the Bellman equation can be made linear even for non-linear or discrete dynamics.

bellman equation, compositionality, optimal control law, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence (0.70)
Information Technology > Control Systems (0.64)

Add feedback

Compositionality of optimal control laws

Todorov, Emanuel

Neural Information Processing SystemsDec-31-2009

We present a theory of compositionality in stochastic optimal control, showing how task-optimal controllers can be constructed from certain primitives. The primitives are themselves feedback controllers pursuing their own agendas. They are mixed in proportion to how much progress they are making towards their agendas and how compatible their agendas are with the present task. The resulting composite control law is provably optimal when the problem belongs to a certain class. This class is rather general and yet has a number of unique properties - one of which is that the Bellman equation can be made linear even for non-linear or discrete dynamics. This gives rise to the compositionality developed here. In the special case of linear dynamics and Gaussian noise our framework yields analytical solutions (i.e. non-linear mixtures of linear-quadratic regulators) without requiring the final cost to be quadratic. More generally, a natural set of control primitives can be constructed by applying SVD to Greens function of the Bellman equation. We illustrate the theory in the context of human arm movements. The ideas of optimality and compositionality are both very prominent in the field of motor control, yet they are hard to reconcile. Our work makes this possible.

artificial intelligence, control law, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Minimal Intervention Principle for Coordinated Movement

Todorov, Emanuel, Jordan, Michael I.

Neural Information Processing SystemsDec-31-2003

Behavioral goals are achieved reliably and repeatedly with movements rarely reproducible in their detail. Here we offer an explanation: we show that not only are variability and goal achievement compatible, but indeed that allowing variability in redundant dimensions is the optimal control strategy in the face of uncertainty. The optimal feedback control laws for typical motor tasks obey a "minimal intervention" principle: deviations from the average trajectory are only corrected when they interfere with the task goals. The resulting behavior exhibits task-constrained variability, as well as synergetic coupling among actuators--which is another unexplained empirical phenomenon.

control law, trajectory, variability, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.14)
North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Industry: Health & Medicine (0.47)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.48)

Add feedback

A Minimal Intervention Principle for Coordinated Movement

Todorov, Emanuel, Jordan, Michael I.

Neural Information Processing SystemsDec-31-2003

Behavioral goals are achieved reliably and repeatedly with movements rarely reproducible in their detail. Here we offer an explanation: we show that not only are variability and goal achievement compatible, but indeed that allowing variability in redundant dimensions is the optimal control strategy in the face of uncertainty. The optimal feedback control laws for typical motor tasks obey a "minimal intervention" principle: deviations from the average trajectory are only corrected when they interfere with the task goals. The resulting behavior exhibits task-constrained variability, as well as synergetic coupling among actuators--which is another unexplained empirical phenomenon.

control law, trajectory, variability, (15 more...)

Neural Information Processing Systems

Country: